Localization and Recognition of Text with Perspective Distortion in Natural Scenes
نویسندگان
چکیده
Recognizing text in natural scene images refers to the problem of identifying words that present on it. Scene text recognition is very difficult due to some reasons such as, images contain very little amount of linguistic context, interpreting versions of letters and digits are required for scene text recognition and also scene text can appear in any orientation. Most of the existing works are focused on the recognition of texts which are frontal parallel to the image plane. We formulate a novel method which is used to recognize text in natural scene images which are perspectively distorted. Perspective distortion is avoided using Hough transform. Each character are recognized from cropped word image. Connected component analysis is used to detect the components that present on the cropped word image. Non text components are filtered using SVM classification. After that text components are recognized by Optical character recognition. We introduce a new dataset called Scene Text-Perspective, which contains scene images of the name boards placed in the road sides which are perspectively distorted. Experimental results on the proposed dataset shows that our method is simple and outperforms the existing methods.
منابع مشابه
Natural scene text localization using edge color signature
Localizing text regions in images taken from natural scenes is one of the challenging problems dueto variations in font, size, color and orientation of text. In this paper, we introduce a new concept socalled Edge Color Signature for localizing text regions in an image. This method is able to localizeboth Farsi and English texts. In the proposed method rst a pyramid using diff...
متن کاملFast perspective recovery of text in natural scenes
a r t i c l e i n f o Cheap, ubiquitous, high-resolution digital cameras have led to opportunities that demand camera-based text understanding , such as wearable computing or assistive technology. Perspective distortion is one of the main challenges for text recognition in camera captured images since the camera may often not have a fronto-parallel view of the text. We present a method for pers...
متن کاملرفع اعوجاج هندسی متون بهکمک اطلاعات هندسی خطوط متن
Document images produced by scanners or digital cameras usually have photometric and geometric distortions. If either of these effects distorts document, recognition of words from such a document image using OCR is subject to errors. In this paper we propose a novel approach to significantly remove geometric distortion from document images. In this method first we extract document lines from do...
متن کاملText detection and recognition in natural images
Natural scenes pose a major challenge to traditional optical character recognition methods because they often contain noise, occlusions, distortions, or relatively small amounts of highly styled text. In this work, we build a probabilistic system which unifies the tasks of text detection and recognition with a language model. We use an efficient multi-scale character detector to locate characte...
متن کاملCharacter Localization From Natural Images Using Nearest Neighbours Approach
Scene text contains significant and beneficial information. Extraction and localization of scene text is used in many applications. In this paper, we propose a connected component based method to extract text from natural images. The proposed method uses color space processing. Histogram analysis and geometrical properties are used for edge detection. Character recognition is done through OCR w...
متن کامل